计算机与现代化 ›› 2013, Vol. 1 ›› Issue (9): 46-49.doi: 10.3969/j.issn.1006-2475.2013.09.011

• 人工智能 • 上一篇    下一篇

基于LDA模型挖掘招聘信息的技术主题

施乾坤,梁正友   

  1. 广西大学计算机与电子信息学院,广西 南宁 530004
  • 收稿日期:2013-03-21 修回日期:1900-01-01 出版日期:2013-09-17 发布日期:2013-09-17

Mining Technology Topics from Recruitment Information Using LDA

SHI Qian-kun, LIANG Zheng-you   

  1. Computer and Electronic Information College, Guangxi University, Nanning 530004, China
  • Received:2013-03-21 Revised:1900-01-01 Online:2013-09-17 Published:2013-09-17

摘要: 互联网中发布着大量的招聘信息,这些信息不仅反映了公司对人才的需求,而且暗含市场中的技术热点和公司的技术主题。针对招聘信息中所蕴含的技术主题,本文提出一种将句子分类和主题抽取相结合的方法。该方法使用LDA模型对招聘信息建模,再通过SVM分类提取出与技术相关的句子,最后从技术相关的句子中抽取出技术主题,整合成招聘信息的技术主题。实验结果表明该方法能够准确地挖掘出招聘信息的技术主题,获得了良好的效果。

关键词: 文本挖掘, 技术主题, LDA模型, 招聘

Abstract: There is a huge number of recruitment information on Internet. These information not only respect the requirements from companies, but also imply the hot technologies in market and the technology topics in companies. For the technology topics implied by recruitment information, this paper proposes a method combining sentence classification and topic extraction. This method uses LDA to model the recruitment information, and then classifies the sentences about technology by SVM, finally, extracts the technology topics from the technology sentence and integrates them into the technology topics of recruitment information. Experimental results show that the method achieves good effect in extracting the technology topics from recruitment information.

Key words: mining text, technology topic, LDA, recruitment information

中图分类号: